A Formal Presentation of MongoDB (Extended Version)

نویسندگان

  • Elena Botoeva
  • Diego Calvanese
  • Benjamin Cogrel
  • Martín Rezk
  • Guohui Xiao
چکیده

A significant number of database architectures and data models have been proposed during the last decade. While some of these new systems have gained in popularity, their formal semantics are generally still missing. In this paper, we consider the symptomatic case of MongoDB, a widely adopted document database, in which roughly speaking relational tables correspond to collections, and tuples to documents. We provide a formalization of the JSON-based data model adopted by MongoDB, and of a core fragment of the MongoDB aggregation query language, mupgl, which includes the match, unwind, project, group, and lookup operators. We study the expressiveness of mupgl by defining a relational view of MongoDB databases and developing a translation from relational algebra to mupgl. Notably, we show that the mupg fragment is already at least as expressive as full relational algebra over (the relational view of) a single collection, and in particular able to express arbitrary joins. We further investigate the computational complexity of mupgl and of significant fragments of it.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Sharded MongoDB Deployment and Benchmarking for Big Data Analysis

Project CH-818664, KVM: Using Python, Ansible, Bash Shell, and Cloudmesh Client a fully automated process is created for deploying a configurable MongoDB sharded cluster on Chameleon, FutureSystems, and Jetstream cloud computing environments. A user runs a single Python program which configures and deploys the environment based on parameters specified for numbers of Config Server Replicas, Mong...

متن کامل

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

On the Expressiveness and Complexity of MongoDB

A significant number of database architectures and data models have been proposed during the last decade. While some of these new systems have gained in popularity, their formal semantics are generally still missing. In this paper, we consider the symptomatic case of MongoDB, a widely adopted document database, in which roughly speaking relational tables correspond to collections, and tuples to...

متن کامل

Expressivity and Complexity of MongoDB Queries

In this paper, we consider MongoDB, a widely adopted but not formally understood database system managing JSON documents and equipped with a powerful query mechanism, called the aggregation framework. We provide a clean formal abstraction of this query language, which we call MQuery. We study the expressivity of MQuery, showing the equivalence of its well-typed fragment with nested relational a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1603.09291  شماره 

صفحات  -

تاریخ انتشار 2016